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Abstract 

The following problem is NP-hard: given a regular expression E, decide if E* is not 
co-finite. This problem is also in PSPACE. 

1 Introduction 

Given k positive integ with gcd(xi, x%, • • • , Xk) = 1, the Frobenius Problem 

is to find the largest integer that cannot be represented as a non-negative integer linear 
combination of the given integers. This largest integer is called the Frobenius number of the 
given integers, and is denoted as g(xi,X2, • • • , One can refer to [1] for a good survey on 
the Frobenius Problem. 

There have been different generalizations of the Frobenius Problem in the literature. One 
generalization of this problem to a free monoid as follows: [3] given a finite set S of words 
on a given alphabet, find the (length of) the longest word(s) not in S* if S* is co-finite. One 
natural variation of this Monoid Frobenius Problem is to give the input set of words S in a 
compressed way. 

Problem 1. Given a regular expression E, find the length of the longest word(s) not in E* 
if E* is co-finite. 

2 NP-hardness Proof 



The NP-hardness of the following decision problem will be proved by giving a polynomial 
reduction from 3SAT, using similar techniques in the NP-completeness proof of star-free 
regular expression inequivalence. 

Problem 2. Given a star-free regular expression E, decide if E* is not co-finite. 

Let U = {u\, U2, ■ • ■ , u n } be a set of variables and C = {ci, C2, • • • , c m } be a set of 
clauses making up an arbitrary instance of 3SAT. Without loss of generality, suppose each 
variable appears in at least one clause. Then n < 3m. For each clause c«, a star-free 
regular expression = u\u\ • • • u l n can be constructed, where — F if Uj appears in q, and 
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u l j = T if Uj appears in q, and u % - = (T + F) otherwise. Let E' = e\ + e 2 + • • • + e m , = 
(T + F)(T + F) — - (T + F) = (T + F) n , E = E' + £"'(T + F). It is easy to check this 
construction can be performed in polynomial time. It remains to see that, in the above 
construction, the clauses C are satisfiable if and only if E* is not co-finite over the alphabet 
£ = {T, F}. The following lemma is required. 

Lemma 3. |5j/ Suppose S C S' m U S n , < m < n, and 5* is co-finite. Then S m C 5. 

Let 5 be the set of words in E. Then S n+1 C S C S n U S n+1 , which follows by the 
construction of E. If C is satisfiable, then one can check that E' ^ E", so Y7 1 (I S. 
Therefore, S* cannot be co-finite. If S* is not co-finite, since gcd(n, n + 1) = 1, then E n S, 
which leads to E' ^ E" . So, C is satisfiable. This finishes the NP-hardness proof of Problem 

El 

As seen in the given polynomial reduction, restricting Problem |2] to the binary alphabet 
and/or demanding that the language of E consist of words of only two different lengths 
results in a problem that is also NP-hard. The following problems can also easily be seen to 
be NP-hard, as consequences of the NP-hardness of Problem [2j 

Problem 4. Given a regular expression E, decide if E* is not co-finite. 

Problem 5. Given a NFA M, decide if S* is not co-finite, where S = L(M). 

3 PSPACE Proof 

Both Problem Hand Problem [5] are in PSPACE. A NPSPACE algorithm for Problem H will 
be presented, and the NPSPACE proof for Problem [5] follows in a straightforward manner. 
Since NPSPACE = PSAPCE by Savitch's theorem, the result will follow. 

Let E be an arbitrary regular expression, and t be the length of E. An NFA M can be 
constructed to accept E* with at most t + 1 states [5j. Let n = 2 t+1 . Then, nondeterminis- 
tically guess a word w of length i for n < i < 2n, and verify that it is rejected by M. E* is 
not co-finite if and only if there exists aw of length i for n < i < 2n such that M rejects 
w. To verify the word w is rejected by M, a boolean matrix of dimension at most t + 1 is 
stored to keep track of reachability. The entry (p, q) is true if and only if q is reachable from 
p on a given word. At the beginning, this matrix is initialized as the identity matrix, for the 
word e. In each step, the matrix is updated to process one guessed letter. This requires only 
polynomial space. 

To prove the correctness of this algorithm, notice that an NFA can be constructed to 
accept the language E* with at most t + 1 states. Hence a DFA M' accepting E* has at 
most n = 2 t+1 statesjl]. Interchanging final and nonfinal states, the DFA M" for E* has 
at most n states. By a classical result [2J, M" accepts an infinite language if and only if it 
accepts a word of length i, n < i < 2n. Hence M"s language is co-infinite if and only if it 
rejects some word of length i, n < i < 2n. 

appendix 

This report is to be a part of a future paper. 



2 



References 

[1] J. L. Ramirez Alfonsm: The Diophantine Frobenius Problem. Oxford University Press, 
2005. 

[2] J. Hopcroft and J. Ullman, Introduction to Automata Theory, Languages, and Compu- 
tation, Addison- Wesley, 1979. 

[3] Jui-Yi Kao, Jeffrey Shallit, Zhi Xu: The Frobenius Problem in a Free Monoid. 
STACS2008: pp.421-432. 

[4] Sheng Yu, Qingyu Zhuang, Kai Salomaa: The State Complexities of Some Basis Opera- 
tions on Regular Languages. Theoretical Computer Science, Volume 125, 1994: pp. 315- 
328. 

[5] Markus Holzer, Martin Kutrib: Nondeterministic Descriptional Complexity of Regular 
Languages. International Journal of Foundations of Computer Science, Vulume 14, 2003: 
pp.1087-1102. 



3 



